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Letter to the Editor 



"Homology" in Proteins and Nucleic 
Acids: A Terminology IMuddle and 
a Way out of It 



'Homology' has the precise meaning in biology of "having 
a common evolutionary origin," but it also carries the loose 
meaning of "possessing similarity or beir»g matched," Ms 
rampant use in the loose sense is clogging the literature 
on protein and nucJeic acid sequence comparisons with 
muddy writing and, in some cases, muddy thinking. 

In its precise biological meaning, "homology" is a con- 
cept o( Quality. The word asserts a type of relationship be- 
tween two or more things Thus, amino acid or nucleotide 
sequences are either homologous or they are not. They 
cannot exhibit a particular "level of homology" or "percent 
homology" Instead, two sequences possess a certain 
level of similarity. Similarity is ihus a quaniitatlve property 
Homologous proteins or nucleic acid segments can range 
from highly similar to not recognizably similar (where 
similarity has disappeared through divergent evolution). 

H using "homology" loosely did not interlere with our 
thinking about evolutionary relationships, the way in 
which we use the term would be a rather unimportant 
semantic issue. The fact Is, however, that loose usage In 
sequence comparison papers often makes it dtfficuli to 
know the author's intent and can lead to confusion for the 
reader (and even tor the author). 

There are three common situations in which hazards 
arise by using 'homology" to mean similarity. The first 
case is the most obvious offense but perhaps the least 
troublesome. Here an author Iderttifies sequence similari- 
ties (calling them homologies) but claims that the se- 
quences being compared are not evolutionarily related. 
Some awkward moments occur in such a paper, since the 
author claims both homology (i.e.. similanty) and nonho- 
mology (I.e., lack ot a common arM:estor]. Nonetheless, 
the author's ideas are Kkely to be clear since arguments 
against common ancestry are presented explicitly. 

A second case is one in which an author points out 
similarities (again called homologies) but does not ao- 
dress the issue ot evolutionary origins The reader, seeing 
the term "homology." may infer that the author is postulat- 
ing coancestfy when thai is not ih« author's intent. 

The linal case occurs mosi freqtantiy and is the rr>ost 
subtle and therefore most troublesome. Here, similarities 
(called homologies) are used to support a hypothesis of 
evolutionary homology. In this case, the two meanings of 
homology teem to overlap, and it is almost inevitable that 
the thinking of author and reader alike will be intrusively 
distorted as follows. Similarity is relatively straightforward 

take the form o1 a numerical score (% ammo acid or 
nucleotide positional identity, in the simplest approach) or 
of a probability associated with such a score. In compari- 
sons of ihree-dimensional structures, a typical numerical 



description is root-mean-squara positional deviation be- 
tween compared atomic positions A similarity, then, can 
become a lully documented, simple fact. On the other 
hand, a common evolutionary ongin must usually remain 
a hypothesis, supported by a set of arguments that might 
include sequence or three-dimensional similarity. Not all 
similarity conr»otes homology but that can be easily over- 
looked if similarities are called homologies. Thus, in this 
third case, we can deceive ourselves into thinking we have 
proved something substantial (avotulionary homology) 
when, in actuality, we have merely established a simple 
fact (a similarity, mislabeled as homology). Homology 
among similar structures is a hypothesis that may be cor- 
rect or mlstakan. but a similarity itself is a (act, however 
it is interpreted, 

We believe that the concepts of evolutionary homology 
and sequence or three-dimensional similarity can be kept 
distinct only if ihey are referred to with different words. We 
therefore ofler the following recommendations; 

•Sequence similarities (or other types of similarity) 
should simply be called similarities. Thoy should be 
documented by appropriate statistical analysis In writing 
about sequence similarities the following sorts of terms 
might be used; a level or degree o1 similarity; an alignment 
with optimized similarity; the % positional identity in an 
alignment; the probability associated with an alignment. 

•Homology should mean "possessing a common evolu- 
tionary origin" and in the vast majority of reports should 
have no other meaning. Evidence for evolutionary homol- 
ogy should be explicitly laid out, making it clear that the 
proposed relatk>nship is based on the level of observed 
similarity, the statistical signiticance of the similarity, and 
possibly other lines ot reasoning, 

One could argue that the meaning of the term "homol- 
ogy" is itself evolving. But il that evolution is toward vague- 
ness and If it results in making our scientilic discourse 
unclear, surety we should intaivene. With a collective deci- 
sion to mend our ways, proper usage would soon become 
fashionable and therefore easy. We believe that we and 
our scientific heirs would t>enefit significantly 
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Altering the Genome by Homologous 
Recombination 



Mario R. Capecchi 



Homologous recombination between DNA sequences 
residing in the dhromosome and newly introduced, 
cloned DNA sequences (gene targeting) allows the trans- 
fer of any modification of the cloned gene into the 
genome of a living cell. This article discusses the current 
status of gene targeting with particular emphasis on germ 
line modification of the mouse genome, and describes the 
different methods so far employed to identify those rare 
embryonic stem cells in whidi die desired targeting event 
has occurred. 



THE IMPLICATIONS OF THE NEW GENE TARGETING TECH- 
nology are far-reaching. If the recipient cell is a pluripotcnt, 
cmbryo-dcrivcd stem (ES) cell, it is possible to transfer a 
modification of a cloned gene, created in a test tube, to the germ line 
of a living organism (1-3). The potential now exists for modifying 
any gene, in a defined manner, in any species from which functional 
ES cells can be obtained. ES cells have been isolated from mouse and 
hamster embryos (4) and major efforts arc currently under way to 
isolate equivalent cells from domestic animals including sheep, pigs, 
and catde. In addition, because many plant cells are intrinsically 
pluripotcnt and the means exist for generating whole plants firom 
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xhesc cultured cells, wc can anticipate the application of gene 
targeting to the modification of plant genomes as well (5). 

The discussion need not, however, be limited to experiments 
directed only at gerai line modifications. In specific cases it may be 
advantageous to modify only certain somatic tissues of an organism. 
For example, as the means to propagate a variety of human somatic 
stem cells (such as hematopoietic, epirfieliaL, liver, or lung stem cells) 
become available, protocols based on gene targeting could be used 
to corrca defective genes in the appropriate human tissue. This 
scenario of human somatic gene therapy has some obvious advan- 
tages over the random insertion of a nondefective gene: for example, 
the corrected endogenous gene is much more likely to be expressed 
in die appropriate tissue at appropriate levels. Further, it should be 
possible to use this approach to correa dominant mutations. 

In addition to its implication for in vivo manipulations, gene 
targeting technology has broad potential for fundamental research 
with cells cultured in vitro. Many biological questions can be 
answered direcdy and more simply with tissue culture systems. In 
such cells, both alleles of an autosomal gene could be modified by 
the sequential application of gene targeting. Cell-lethal phenotypcs 
could be maintained and analyzed by a variety of techniques, 
including the introduction of a transgcne under the control of an 
inducible promoter. 

However, in tliis article I will emphasize experiments involving 
mouse cmbryo-dcrivcd stem cells. This choice is based on die 
interest and potential of using targeted, modified ES cells as a 
vehicle to generate mice of any desired genotype. Unfortunately, 
this choice precludes reviewing the gene targeting literature leading 
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up to these experiments. For those interested in this literature, I 
would recommend the 1984 Cold Spring Harbor Symposium on 
homolog(nis recombination (6) and a recent review (7). I will also 
discuss various approaches being used to modify nonselectabic 
genes and venture some guesses as to where the field is likely to 
progress. 



From ES Cells to Germ Line Chimera 

Figure 1 outlines the procedure for introducing a designed 
mutation into the germ line of mice by means of targeted modifica- 
tion of the ES cell genome. A targeting vector, containing the 
desired mutation, is introduced into ES cells by elcctroporation or 
microinjection. In nKwt cells the targeting vector inserts randomly 
into the ES genome. However, in a few cells, the targeting vector 
pairs with the cognate chromosomal DNA sequence and transfers 
the mutation to the genome by homologous recombination. Screen- 
ing or enrichment procedures (or both) are then used to identify the 
rare ES cell in which the targeted event has occurred. The appropri- 
ate cell is then cloned and maintained as a pure population. Next, 
the altered ES cells are injected into the blastocoel cavity of a 
preimplantation mouse embryo and the blastocyst is surgically 
transferred into the uterus of a foster mother where development is 
allowed to progress to term. The resulting animal is chimeric in that 
it is composed of cells derived firom both the donor stem cells and 
the host blastocyst In die particular example shown in Fig. 1, the 
ES cells ate derived from a mouse homozygous for die black coat 
color allele and the recipient blastocyst is derived fiom an albino 
mouse. ITic fiir of the resulting chimeric mouse has patches of both 
colors because the mouse contains cells of both genotypes. Breeding 
of the chimeric mouse to an albino mouse yields some black mice, 
indicating that the ES cells contributed to the formation of the germ 
line. Genomic screening of these progeny is used to determine 
which mice received the allele carrying the targeted mutation. 
Interbreedii^ of heterozygous siblings yields aninuls homozygous 
for the desired mutation. 



Disrupdon of hprt 

Mamnlalian cells can mediate recombination between homolo- 
gous DNA sequences but they demonstrate an even greater disposi- 
tion for mediating nonhomologous recombination. The problem is 
dius to identify homologous recombination events in a vast pool of 
scattered, nonhomologous recombination events. The hypoxanthine 
phosphoribosyl transferase gene {hprt) has provided an ideal model 
system for developing the technique of gene targeting in ES cells, 
because one can select dirccdy for the targeting event. Since this 
gene is on the X chromosome, only one mutant copy is needed to 
yield the recessive hprt~ phcnotypc in male ES ceUs. The kprt~ cells 
arc scieacd by growth in the presence of the base analog, 6 
thioguaninc (6^TG), which kills hprt+ cells. 

Figure 2 illustrates the use of sequence replacement and sequence 
insertion vcaors to disrupt hprt (S). Using yeast as a paradigm, we 
anticipated that sequence replacement vectors would replace endog- 
enous DNA with exogenous sequences, whereas sequence insertion 
vectors would insert the entire veaor DNA sequence into the 
endogenous locus. Since the final products are predicted to be 
different when these two classes of vectors are used (note the partial 
duplication of the gene in Fig. 2B), each vector could generate 
different types of mutant alleles. The presence of the gene encoding 
neomycin phosphotransferase (neo) within the eighth exon of hprt 
disrupts the hprt coding sequence and also provides a selectable 



Fig. 1. Generation of 
mouse germ line chinK- 
ras fix>m embryo-derived 
stem (ES) cells contain- 




Germ-line transmission of 
ES cell genome containing 
targeted modification - 



marker (resistance to the drug G4I8). 

Both types of targeting vector were introduced into ES cells by 
electroporation. The transfcctants that svurived selection in G418 
and 6-TG had lost hprt activity as a result of a targeted disruption of 
hprt. Both types of vectors were equally efficient at disrupting- the 
endogenous hprt (8). Furthermore, replacement and insertion vec- 
tors showed the same dependency of die targeting ftcquency on th,c 
extent of homology between the targeting vector and endogenous 
DNA sequences (Fig. 3). (The word "homology" is used here to 
describe participants in homologous recombination, which ace 
generally identical.) Over the range tested, firom 2.9 to 14.3 kb, 
fivefold increase in DNA sequence homology resulted in roughly-a 
100-fold increase in the targeting frequency. In thc.above experi- 
ments, the amount of nonhomology {neo) being transferred to tht 
target was kept constant. Therefore, it has not been determined 
whether the critical parameter is the absolute extent of sequence 
homology between the incoming DNA and the target, or whether 
the relative amount of nonhomology is also important. However, 
with die largest targeting veaor the absolute targeting firequency 
was one independent targeting event per 3 x 10* ES cells electro- 
porated. 

Smithies and his colleagues have corrcaed a defective hprt in ES 
cells using a sequence insertion vector (9) and have inactivated hprt 
with a sequence replacement vector (10). In the former experiments 
the recipient ES cells contained a spontaneous 5' deletion in hprt and 
die incoming targeting vector supplied the missing exons. In the 
latter experiments the replacement-type veaor contained only 1.3 
kb of hprt sequence homology disrupted by a promotcrless neo. As 
will be discussed shortiy, use of a promotcrless neo in die targeting 
vector yields an enrichment for homologous versus nonhomologous 
recombination events. More recently Thompson et al. (3) generated 
germ line chimeras from ES cells in which a mutant hprt had been 
corrcaed by gene targeting. This demonstrates that ES cells trans- 
fcaed by electroporation and subjcacd to HAT selection still retain 
their ability to contribute to a functional germ line. 

It has not been straightforward to make quantiutive comparisons 
among hprt targeting experiments done in different laboratories. 
Targeting veaors containing different extents of honrology to the 
endogenous target were used. In addition, the targeting vectors have 
been directed to different regions of hprt. The simplest means for 
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cbniparison is in terms of absolute fiequcncy of homologous 
recombination. How many cells were clcctroporatBd and how many 
independent targeting events were recovered? With this criterion for 
comparison and with allowance for the difiering extents of homolo- 
gy to the target, the results from the different laboratories are in 
moderate agreement [When targeting vectors containing approxi- 
mately 2 to 4 kb of hprt homology were used, absolute targeting 
frequencies in the range of one event per 5 x 10^ to 5 x 10* treated 
cells were reported {3, 8-W)]. Altctnativciy, it is possible to 
compare the ratio of homok^us recombination events to nonho- 
mologous recombination events. Unfortunately this ratio is depen- 
dent on the method used to estimate the frequency of nonh(Mnolo- 
gous recombination. As an extreme example, if the comparison was 
made to the transfection efficiency measured with a completely 
defective selectable gene, this ratio would be iniSnitcly large. Indeed 
a wide range of ratios have been reported, from 0.001% to greater 
than 10%. It should also be pointed out that even when the same 
selectable gene is used, in the same nucleotide environment, mea- 
surements of both the targeting and random insertion frequency 
may be less than ideal, since expression of the selectable gene will 
inevitably be different at the target locus relative to random lod. 
Thus, the ratio will be dependent on the thresholds set by the 
selection conditions. 



Nomelectable Genes 

The advantages of hprt — its presence as a single copy in male cells 
and the ability to use direa selection as a way of isolating homok> 
gous recombinants — are not the case for most genes of interest. 
Indirect enrichment or screening procedures must be used to 
identify the rare ES cell in which a nonselectablc gene has been 
inactivated. 

A very sensitive screening method makes use of the polymerase 
chain reaction (PGR) (11) to specifically amplify a novel DNA 
junction created by the targeting event {12). Two groups have 
recently reported success using this approach t» screen pools of 
transfcctants and detect the rare ES ceil in which the targeting event 
occunrd. Joyner et al. have disrupted the homeobox-containing 
gene en-2 in ES cells by means of a replacement vector in which one 
of the en-2 exons was interrupted by neo (li). After introducing the 



m-2/neo' targeting vector into ES cells by clcctroporation and 
selecting for G418' cells, the authors screened pools of 0418"^ 
colonics by PCR to specifically amplify the «*o-disrupted en-2. They 
detected approximately one targeting event per 300 G418'' colonies. 

2^inuncr and Gruss have disrupted another nxxisc honteobox- 
containing gene, hox 1.1 (14). They used a replacement vector in 
which the homcobox domain carried a 20-bp insert, which disrupt- 
ed the coding sequence and created a novel hybridization site for 
one of the PCR primers. The targeting vector was introduced into 
ES cells by microinjection. Since no selectable marker was used, ES 
cells in which a targeting event had occurred were identified solely 
by the presence of a specific PCR product. Only homologous 
recombination would juxtapose the two PCR primers, one within 
the targeting vector and the other from flanking /lox 1.1 sequences in 
the endogenous gene. Approximately 1 in 150 cells receiving an 
injection yielded the predicted amplified PCR fragment. 

It is also possible to use the ds-acting regulatory scqueiKes of the 
target gene in order to enrich for successful rccombbation events. 
In this case, the targeting vector is designed in such a way that 
expression of the selectable marker, neo, depends on homologous 
integration to supply a missing promoter or enhancer, as described 
by Jasin and Berg (15) as well as Sedivy and Sharp (16). In this way, 
a several hundredfold enrichment for targeted intentions relative 
to randcHn insertion of the vector was obtained. This strategy only 
applies to genes that are expressed in the recipient cell line. 



The Positive-Negative Selection (PNS) 
Procedure 

Recently, we have described an enrichment procedure that is 
independent of die fiinction of the target gene (17). This procedure 
uses a positive selection for cells diat have incorporated the targeting 
vector anywhere in the ES cell genome and a native selection 
gainst ceils that have randomly integrated the vector. The net effect 
is to enrich for cells containing the desired targeted mutation. The 
vector contains 10 to IS kb of DNA homologous to the target gene, 
a neo inserted, abi^ with its own strong promoter, into an cxon of 
that sequence; and thymidine kinase gene fiom herpes simplex virus 
(HSV-tk) adjacent to the region of homology. 

This vector was designed so that honK>logous recombination will 



Fig. 2. Disruption of Aprt 
by gene targeting with (A) 
a set^uence replacement 
targeting vector or (B) a 
sequence insertion target- 
ing vector. Vectors of 
both classes contain hprt 
sequences interrupted in 
the eighth cxon with neo 
gene. With the sequence 
replacement vector, after 
homologous pairing be- 
tween me vector tnd ge- 
nomic sequences, a recom- 
bination event replaces the 
genomic sequerKe with 
vector sequences contain- 
ing neo. Sequence inscr- 
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n suit in die HSV-tk gene not being transferred into tljc target locus. 
■ l Uc HSV-tk gene is lost during the process of homobgous 
c ombination because it is kxatcd distal to die region of homology 
K I ween the vector and the target. Only cells in which random 
iiiicgration of the targeting vector has occurred will be able to retain 
I he HSV-tk gene. This is prediaed from the observation diat most 
r.iiidom insertions of exogenous, linearized DNA into the genome 
Dccur via their ends (18). Therefore, by using G418 to selector 
u lls that contain a fimctional neo' gene and by using gancyclovir to 
si lca against cells that contain a fimctional HSV-dc gene, we can 
enrich for cells in which the targeting event has occurred. 

We have tested the above PNS approach by using it to disrupt the 
mouse hprt, mt-2, hox 1.2, and hox 1.3 genes {17, 19) and observed 
.IS much as a 2000-fold enrichment for targeted disrupdons. 



Targeting iirto Genes Not Expressed in ES Cells 

An ideal strategy for disrupting a gene not expressed in ES cells 
has not yet emerged. Preliminary attempts in which the PNS 
procedure was used to mutate one such gene, the proto-oncogcne 
i>ir-l, suggests that the targeting frequency is much lower than that 
obtained at the hprt, int-2, hox 1.2, and hox 1.3 loci (20). It may be 
possible to increase the enrichment factor by strategies designed to 
enhance the stringency of either the positive or negative component 
of the selection. As an exanqjlc of the latter, to reduce the frequency 
of HSV-tk loss during transfccdOT, two HSV-tk genes could be 
inserted into the targeting vector, one at each aid, and then at least 
one should survive transfection to permit sclecdon against cells 
containing random integrations. Placement of a large block of 
nonhomology at both ends of the linearized targeting vector, as is 
requited to construa a double tk vector, docs not affca the 
frequency of homologous recombination (20). 

Among alternative approadics to consider are the use of pure 
screening procedures or enrichment procedures. Embedded within 
the decision of whether to use nonselative or selective protocols is 
die choice of whether to deliver the targeting veaor to the recipient 
cells by microinjection or electroporation. 

Microinjection into the nucleus results in very high frequencies of 
stable transfectants (10 to 20% of the cells receiving DNA) (21). 
However, only a single cell at a time can be microinjcctcd, whereas 
electroporation allows large numbers of cells to be simultaneously 
transfected. After electroporation ~1% of ES ceils are stable trans- 
fiKtants (17). The choice of transfection protocol should be deter- 
mined, in part, by the method chosen to identify and isolate the ES 
cells in which the desired targeting event has occurred. Methods that 
rely solely on screening are dependent on the absolute targeting 
frequency and therefore microinjection should be considered. On 
the other hand, since enrichment procedures rely on selection they 
arc not as dependent on the absolute targeting frequency and can 
take advantage of mass transfection protocols. 

As already discussed, Zimmer and Gniss (14) used microinjection 
to deliver die targeting vector and reported that 1 in 150 cells 
receiving an injection of an altered hox 1 . 1 genomic fragment yielded 
a disrupted hox 1.1 gene. The reasons for this very high frequency of 
homologous recombination are not clear, but some or all of the 
following factors may have played a role: (i) delivery of the DNA by 
microinjection, (ii) absence of a selection protocol to isolate the ES 
cells in which a targeting event had occurred, (iii) disruption of the 
hox 1.1 genomic fragment in the targeting vector with only a small 
(20-bp) insert, or (iv) a hot spot for recombination at the hox 1.1 
locus in ES cells. 

Intrachromosomai gene conversion experiments in cultured mam- 
malian cells suggest that the frequency of recranbination events is 



inversely related to the length of nonhomology that must be 
corrected (22). Therefore, small disruptions in die targeting vector 
may &vot higher targeting frequencies. This parameter has not, 
however, been directly measured in gene targeting experiments. 
Since tte DNA substrates participating in intradiromcMomal and 
gOK targeting reactions may be quite different, extrapolating the 
results from one set of experiments to the other could be misleading. 

On the basis of the above discussion, a number of alternative 
approaches can be suggested that may allow disruption of a gene not 
expressed in the recipient cell. Among these alternatives are: (i) 
microinjection of the targeting vector into the recipient cells fol- 
towed 1^ PGR to screen for die transfix' of a small oligonucleotide 
insert to the desired target gene, (ii) electroporation of die targeting 
vector followed by PGR or (iii) electroporation of the targeting 
vector followed by an enrichment procedure, such as PNS, to select 
cell lines in which the targeting event has occurred. Each approach is 
associated with its own risks. If the targeting frequency with 
microinjection is in the range of one event per 10* cells, then this 
procedure is extremely labor-intensive. The absolute fixquency of 
targeting events by the second procedure could be less dian one 
event per 10* electroporatcd cells. Under these conditions, it 
becomes a challenge to use PGR to identify the targeting event. 
Finally, concerning the third approach, it is possible that the 
chromosomal environment surrounding a silent gene in ES cells may 
repress expression of the selectable marker. Under these conditions, 
enrichment procedures such as the described PNS procedure cannot 
be used. 



The Future of Gene Targeting 

As dificrcnt methods of identifying successful targeting events are 
applied to more genes, their strengths and limitations will become 
evident. Gurrent techniques should permit the generation of mouse 
mutants in many, if not all, genes. In particular, if the gene is 
expressed in ES cells, dien enrichment procedures should allow the 
desired, targcted-modified ES cells to be found among a small 
number of selected clones (1 to 20). If, on the other hand, 



Rg. 3. The targeting frtquency 
at the hprt bcus as a function of 
the extent of homology be- 
tween the targeting vector and 
the endogenous target. A scries 
of replacement. A, and inser- 
tion, O, hprt targeting vectors 
were consmicted that varied in 
the extent of homology to the 
endogenous hprt. Hprt se- 
quences in each vector were in- 
in the eighth exon 
The amount of hprt 
3' to MO was kept 
constant and the amount of hprt 
sequence 5' to nro varied. TTie 
conditions for electroporai 



1 ftw G4l8^-6-TG' 
ES cells were as described (7). 
AU G418'-6-TG' ccU lines were 
hprt- as a result of targeted dis- 
of hprt. Independence 



ruption 
of the 



by only scoring 
G418'-6-TG' coboies rfiat 
arose on separate plates. The 
absolute targeting frequency 
number of independent targeting 



ts per total ES cells electroporatcd is 



plotted on a logarithmic scale as a function of the number of fcilobascs of hprt 
sequence contained within the targeting vr^" 



I6 junn 1989 



ARTICLES Upl 



expression of the gene of interest is not detectable in ES cells, the 
experimental options are less well defined. 

With the above proviso, the ftiture of gene targeting in mammali- 
an systems appears very bright. Generation of specific mouse 
mutations via gene targeting should have a major impaa on all 
phases of mammalian biology, including development, cancer, 
immunology, neurobiology, and human medicine. For example, 
recent molecular genetic analysis of development in Drosophila has 
revealed a network of genes that control the formation of its 
metamcric pattern (23). On the basis of DNA sequence similarity, 
related genes, such as the hox genes, have been identified in the 
mouse (24). The embryonic expression patterns of dicse genes imply 
roles in establishing positional information during development. 
How closely the function of the mouse genes may parallel the 
fimction of the Drosophila homologs, if at all, remains to be 
determined. Targeted disruption of these genes may not only reveal 
the phenotypes associated with the inactivation of the individual 
genes, but, through epistasis and molecular analyses, may also help 
define the developmental network controlling early mouse morpho- 
genesis. 

Molecular analysis of tumors and transformed cells has revealed a 
plethora of genes contributing to malignant growth. The normal 
function of these proto-oncogcnes is cutrently deduced fix>m such 
features as their protein products, their cellular compartmentaliza- 
tion, and dieir expression pattern. Genetic dissection of their 
function in the mouse or in tissue culture systems should permit a 
more precise definition of the normal function of these genes, and 
potentially, a better appreciation of the role of oncogenes in causing 
malignancy. The development of genetically engineered mice in 
which the effects of deficiencies in anti-oncogenes, such as the 
retinoblastoma gene, could be studied would prove of great experi- 
mental value. On the other hand, the genetic fimction of some of the 
more ubiquitously expressed proto-oncogcnes, such as myc, Jos, and 
jun, may be more dearly defined in tissue culture systems rather than 
in the intaa animal. ES cells could be a particuiarly attractive cell 
culture line for such an analysis. These cells are euploid and stable 
widi respea to karyotype. Further, they can be induced to difictenti- 
ate in vitro into many different ccli types. If these factors participate 
in making early cell lineage decisions, then loss of function alleles 
could lead to a restriction in the spectrum of final difercntiated cell 
types. 

Immunology appears to be a particularly fertile field for genetic 
analysis by gene targeting. It is by for die best charaaerized cellular 
system in mammals, and extensive molecular analysis has identified 
many of the genetic components responsible for immunological 
diversity. Further, since the immune system is dispensable, analysis 
of many of the null phenotypes should be simplified. 

Approximarely 3500 different human genetic diseases are known. 
As the genes responsible for these diseases are identified and cloned, 
disruption of the corresponding genes in the mouse should provide 



uscfiil models for these human diseases. Such models will focilitatc 
analysis of die pathology of the disease and provide a system for the 
exploration of new therapeutic protocols including gene therapy. 

A new arena for biotechnology is the application of transgcncsis 
to the improvement of domestic animals and plants, as well as for 
the production of rare products such as pharmaceuticals within 
domestic animals (as discussed in this issue). These cfcrts will 
certainly be complemented and augmented by the use of gene 
targeting to modify die host genome. Transgcncsis and gene 
targeting are often directed towards different ends, the former being 
used to gain new functions, the latter being used to augment or to 
generate loss of existing functions. As regulatory loops become 
better defined, it should become possible to alter such loops by gene 
targeting, thereby reducing the production of undesircd products 
(such as fet content in meat) or increasing the production of desired 
products (such as pharmaceuticals in milk). The power of gene 
targeting resides in die ability of the experimenter to precisely 
choose both die gene to be modified and the specific change to be 
introduced. 
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